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Claims 

1. Image processing device that can receive and code image data, characterized by the 
fact that it comprises a dividing means that divides the aforementioned input image data into 
predetermined block units, a transform means that applies an orthogonal transform to each of the 
aforementioned blocks, an identification means that identifies the image characteristics of each of 
the aforementioned blocks using the transform coefficient obtained from the orthogonal 
transform by the aforementioned transform means, an acquisition means that acquires the 
optimum quantization weights for each of the aforementioned blocks according to the 
identification result by the aforementioned identification means, a quantization means that 
quantizes the aforementioned transform coefficient using the quantization weights acquired by 
the aforementioned acquisition means, and a coding means that codes the transform coefficient 
quantized by the aforementioned quantization means. 

2. Image processing device in accordance with Claim 1 , characterized by the fact that the 
aforementioned acquisition means comprises several quantization tables, and according to the 
identification result by the aforementioned identification means, selects the quantization table 
that has the optimum quantization weights. 

3 Image processing device in accordance with Claim 1, characterized by the fact that the 
aforementioned acquisition means has one quantization table, and according to the identification 
result by the aforementioned identification means, acquires the optimum quantization weights by 
multiplying a prescribed constant to the quantization weights in the aforementioned quantization 
table. 

4 Image processing device in accordance with Claim 1, characterized by the fact that 
using the aforementioned image characteristic, each of the aforementioned blocks is identified as 
a block with letters or a block with natural objects or backgrounds. 

5. Image processing device in accordance with Claim 1, characterized by the fact that the 
aforementioned image characteristic is represented by one or more bits. 

6 Image processing device in accordance with Claim 1, characterized by the fact that the 
aforementioned image characteristic is determined by using prescribed frequency components of 
the aforementioned transform coefficients. 

7. Image processing device in accordance with Claim 1, characterized by the fact that the 
bits that represent the aforementioned image characteristic are coded by entropy coding. 
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Detailed explanation of the invention 
[0001] 

Industrial application field 

The present invention pertains to an image processing device. In particular, it pertains to 

an image processing device that codes image data. 

[0002] 
Prior art 

A prior art system known for high efficiency compression coding of color still images is 
the JPEG system (Joint Photographic Experts Group, Vol. 20, No. 1, 1991, pp. 50-58), which is 
currently being internationally standardized. 

[0003] 

Figure 10 is a block diagram that illustrates the processing procedure of the JPEG system. 

[0004] 

In the JPEG system, the input image data is first sampled, and then divided into blocks of 
8 x 8 pixels, and next, the two-dimensional discrete cosine transform (hereafter, called DCT) is 
executed (121 in Figure 10). Then, for the obtained 8 x 8 DCT transform coefficients, normalized 
quantization is performed with weights that are different at each coefficient position using a 
quantization table (8 x 8 pixels) (122 in Figure 10). Finally, of the quantized DCT transform 
coefficients, for DC coefficients, the difference with the DC coefficient of the previous block 
with the same color component is obtained, and Huffman coding is applied. For AC coefficients, 
first, zigzag reordering is applied, transform is made to one dimension, and entropy coding 
(two-dimensional Huffman coding) is performed to produce long runs of continuous "0" 
coefficients and effective coefficients other than "0" (123 in Figure 10). The obtained code is 
then output as the coded data. 

[0005] 

Problem to be solved by the invention 

However, the aforementioned conventional example has a problem. To increase the 
coding efficiency, that is, to reduce the number of codes, the number of codes is reduced by 
increasing the weights in the quantization table. Thus, the high-frequency components of AC 
coefficients are particularly cut off by the quantization process. Therefore, if an image with both 
letters and natural objects is coded, deterioration (mosquito noise, etc.) in picture quality, 
particularly around the letters, is apparent. 



[0006] . , 

The present invention was made considering the aforementioned conventional example. 

The purpose is to offer an image processing device with little deterioration in picture quality 

even in the processing of images with, for instance, letters and natural objects. 

[0007] 

Means to solve the problem 

To realize the aforementioned purpose, the image processing device of the present 

invention adopts the following constitution. 

[0008] j , . 

That is it is equipped with an image processing device that can receive and code image 
data characterized by the fact that it comprises a dividing means that divides the aforementioned 
input image data into predetermined block units, a transform means that applies an orthogonal 
transform to each of the aforementioned blocks, an identification means that identifies the image 
characteristics of each of the aforementioned blocks using the transform coefficient obtained 
from the orthogonal transform by the aforementioned transform means, an acquisition means that 
acquires the optimum quantization weights for each of the aforementioned blocks according to 
the identification result by the aforementioned identification means, a quantization means that 
quantizes the aforementioned transform coefficient using the quantization weights acquired by 
the aforementioned acquisition means, and a coding means that codes the transform coefficient 
quantized by the aforementioned quantization means. 

[0009] 

Operation , . 

According to the aforementioned constitution, the present invention functions as follows: 
The image characteristic of each block is identified from the transform coefficient for the block 
to which an orthogonal transform has been applied, and the quantization of the aforementioned 
transform coefficient is made by obtaining the optimum quantization weights according to the 
identification result. 

[0010] 

Application examples . 

Next, appropriate application examples of the present invention will be explained in 

detail with reference to the attached drawing. 



[0011] 

Figure 1 is a block diagram that illustrates the scheme of the entire constitution of the 
image processing device, which is a representative application example of the present invention. 

[0012] . 

In Figure 1, (10) is an image input part, which comprises an image reading device such as 
an image scanner including a CCD sensor, and interfaces with external equipment such as a host 
computer, an SV camera, a video camera, etc. The image data input from the image input part 
(10) is fed to the input terminal (100) of the image memory part (1 1) illustrated in Figure 2. (12) 
is an operating part where the operator designates the output destination of the image data, etc., 
and (13) is an output control part. The former selects the output destination for the image data, 
while the latter outputs a synchronous signal (ITOP signal from the output control part that 
constitutes the printer engine along with the image output part, or connection information, for 
instance, according to the manual key input from the operating part or corresponding to the 
image output part (printer resolution) from the image output part). (125) and (126) respectively 
indicate the output terminal of the image memory part and the input terminal of the synchronous 
signal of the image memory part (1 1). (14) is an image display part such as a display, (15) is a 
communications part that transmits and receives image data over telephone lines or a LAN, and 
(16) is an image output part such as a laser-beam printer, which forms latent images by 
irradiating a laser beam onto a photosensitive substance in order to form visual images. 

[0013] , f 

The image output part (16) may also be an Inkjet printer, thermal transfer printer or a dot 

matrix printer. 
[0014] 

Figure 2 is a block diagram that illustrates the detailed constitution of the image memory 
part (1 1) illustrated in Figure 1. In the image memory part (1 1), the input image data is divided 
into blocks, and the image data in each block is coded according to the JPEG system according to 
the image characteristic of each block. 

[0015] 

In Figure 2, (21) is a DCT transform part which divides the input image data into block 
units of 8 x 8 pixels and applies DCT for each block, (22) is a letter block identification part 
which identifies whether the block contains letters based on the DCT transform coefficient 



obtained from the DCT transform part (21), (23) is a quantization table switching part that 
switches the quantization table according to the identification result by the letter block 
identification part (22), (24) is a normalized quantization part that normalizes and quantizes DCT 
transform coefficients, (25) is a Huffman coding part that Huffman codes quantized data, (26) is 
a region identification bit coding part that codes the region identification bit, which is the 
identification result from the letter block identification part (22), and (27) is an image memory 
that stores coded data. 

[0016] 

Next, let us explain the processing of the input image data in the image memory part (1 1). 

[0017] . co 

First, the image data input from the input terminal (1 00) is divided into block units of 8 x 
8 pixels, and two-dimensional discrete cosine transform (DCT) is applied by the DCT transform 
part (21). The DCT transform coefficients obtained by the DCT transform part (21) consist of 
direct current (DC) components and alternating current (AC) components. The AC components 
consist of components of various frequencies, which are expressed as illustrated in Figure 3. In 
Figure 3, the small square expresses each DCT transform coefficient. The components are of 
lower frequencies toward the upper left, and higher frequencies toward the lower right. The 
coefficients in the shaded area in the upper left indicate direct current (DC) components, and the 
other coefficients indicate alternating current (AC) components. 

[0018] Now, the letter block identification part (22) identifies whether a block to which DTC has 
been applied is a block containing letters (hereafter, this is called a letter block) or a block 
containing natural objects or picture background (hereafter, this is called natural object 
+ background block) using the value of the transform coefficient obtained from the DCT 
transform part (21). 

[0019] 

In the present application example, the letter block is identified as follows: 

[0020] (1) As illustrated in Figure 4, of the DCT transform coefficients, the sum (S) of absolute 
values of 20 AC components in the low frequency band indicated with the shaded part is 
obtained. 



m 



[0021] (2) This sum (S) is compared with a prescribed threshold value (TH) (for instance, 500). 

(3) If S > TH, it is identified as a letter block, but if S < TH, it is identified as a natural 
obj ect + background block. 

[0022] The identification results are output as region identification bits (if the value is "1," the 
block is a letter block, but if the value is "0," the block is a natural object + background block). 
Figure 5 visually illustrates the relationship between region identification bits and blocks. As 
illustrated in Figure 5, each block has one region identification bit that expresses whether it U a 
letter block or a natural object + background block. 

r0023] 

DCT transform coefficients are quantized by the normalized quantization part (24). 

[0024] As illustrated in Figure 6(b), quantization tables that have been commonly used in the 
past have quantization weights for high-frequency components of DCT transform coefficients 
that are much larger than for low-frequency components. Thus, there is a tendency that many 
values in the high-frequency components are cut off by the quantization process. Tins is believed 
to be a cause of deterioration in picture quality in the edges of letters. Then, in the present 
application example, as a quantization table used for blocks identified as letter blocks such as 
illustrated in Figure 6(a), a quantization table that quantizes high-frequency components to a 
finer degree and low-frequency components to a slightly rougher degree is used. Then, for 
natural object + background blocks, the quantization table illustrated in Figure 6(b) is used. 

[0025] This switching of quantization tables is performed by the quantization table switching 
part (23) according to the identification result from the letter block identification part (22). In this 
manner, two quantization tables are used by selecting one of the two, thereby the deterioration in 
picture quality in areas with letters can be controlled. 

[00261 Using the quantization table obtained in this manner, the DCT transform coefficient is 
quantized by the normalized quantization part (24), and furthermore, the quantized data is coded 



by the Huffman coding part (25). The data that is coded (coded data 1) is stored in the image 
memory (27). 

[0027] 

Meanwhile, the region identification bit is also input to the region identification bit 
coding part (26), where entropy coding (for instance, arithmetic coding, MMR, etc.) is executed, 
and the resulting coded data (coded data 2) is output to the image memory (27). In this manner, 
the increase in the amount of codes due to the generation of region identification bits can be 
minimized. 

[00281 Therefore, aceording to the present application example, each of the divided Mocks of 
input image data is identified as either a letter block or a natural object + background block, and 
according to me identification result, quantization is executed for each block ustng the opfimum 
quantization table. Thus, even if an image contains both letters and natural objects and then- 
backgrounds, deterioration in the picture quality of the areas with letters due to codrng can be 
controlled. 

100291 Now, in the present application examp.e, as the size of the Mocks that divide the input 
image data, 8 x 8 pixels were used, and OCT was used as the orthogonal transform applted to the 
data of each block. The present invention, however, is not limited to this example. For .nstance, 
larger or smaller blocks may be used, and Hadamard transform, etc. may be used as the 
orthogonal transform. 

[00301 Also in the present application example, to identify letter blocks of DCT transform 
coefficients,'the sum of 20 absolute values of AC components that belong in the low frequency 
band was used. The present invention, however, is not limited to this example. For instance 
coefficients of other frequency components may be used, and the number of coefficients used 
does not have to be 20. Besides, instead of a sum of absolute values, for instance, a value that can 
be an index may be obtained using root mean squares. 

[00311 Furthermore, if the letter block region and the natural object + background block region 
are structured and pre-divided as illustrated in Figure 7, the device may be constituted so that 



they will be divided into the part (a) (letter block region) and the other part in preliminary 
process by manual input or some other process, so that the quantization steps will be switched for 
each region as in the case of present application example. 

[° 032 ] • - * 

Furthermore, in the present application example, two quantization tables such as 

illustrated in Figure 6 were respectively used for the quantization of letter blocks and natural 

object + background blocks. However, naturally, other quantization tables may also be used. 

[0033] 

Furthermore, in the present application example, the blocks were classified into two 
types- i e letter blocks and natural object + background blocks, using the value of transform 
coefficients obtained from the DCT transform part (21), and the quantization tables were selected 
according to type for quantization. However, the present invention is not limited to this example. 
For instance, as illustrated in Figure 8, one quantization table (33) may be provided, and a certain 
constant (step multiplier) is obtained for each letter block or natural object + background block 
identified in text block identification part (22), the step multiplier may be multiplied to the 
quantization weights obtained from the quantization table (33) in the multiplication part (32), and 
quantization may be executed based on the result of this multiplication. In this case, if the step 
multipliers are determined to be "0.5" for letter blocks and "1.0" for natural object + background 
blocks, the quantization weights for letter blocks will be smaller, and as a result, deterioration in 
the picture quality of areas with letters will be controlled. 

[0034] In addition to this, the block classification by the letter block identification part (22) does 
not have to consist of two types. Assuming the region identification bit output from the letter 
block identification part (22) has a 2 bit structure, four types of blocks, for instance, four blocks; 
i.e., letter block a, letter block b, letter block c and natural object + background block may be 
identified, and the step multipliers for these may be respectively set to "0.4," "0.6," "0.8," and 
"1 .0." Then the same type of process may be executed. 

Furthermore, in the present application example, DCT transform coefficients were used 
to identify the type of blocks. The present invention, however, is not limited to this example. For 
instance, as illustrated in Figure 9, the input image data may be directly input to the letter block 
identification part (22) to execute the identification process. Specifically, the image data may be 
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divided into blocks of m X n pixels, and blook iden.iSea.ion may be made by — g Una. .he 
thre shold value. Here, .he block of m X n pixels does not necessarily have .0 match .he lock 
of (maximnm - minimnm), for instance, the variance within .he block may also be used. 

100361 Now .he presen. invention may be applied .0 a system comprising two or more devices. 
Or .. may be applied .0 a system comprising one device. Needless to mention, the presen. 
invention maylbe applied toacase wherein it isrealized by feeding a program ,„ a system or 

a device. 
[0037] 

Effect of the invention m , 

As explained above, acoordtng to the presen. invention, the input tmage data « d.vrded 
into several prescribed blooks units, orthogonal hansform is done on each block, the mange 

orthogonal t^sform is applied, the optimum Ration weighs are obtamed acconhng ^ the 
resuh of the dis.inc.ion, and the aforementioned transform coefficient ,s quantized. Thus, 
i„s,anoe, even with an image containing bom letters and natural objects and 
optimum qu^tization is processed, and deterioration in .he piomre quahty oan be effectively 

controlled. 

BriefdgscriBtion nf the figures 

Figurel is a block diagram that illustrates the scheme of the entire consutirtion of the 

image processing device, which is a representative application example of Ute presen. ,nven«,on 
Figure 2 1 a block diagram that illustrates the de.ai.ed constitution of .he .rnage memory 

part (1 1) illustrated in Figure 1 . 

Figure 3 is a diagram that illustrates the structure of DCT transform coefficient. 
Figure 4 is a diagram that illustrates the DCT coefficient used for letter block 

5 is a diagram that conceptually illustrates the relationship between region 

identification bits and blocks. 

Figure 6 is a diagram that illustrates the constitution of the quantization table. 

Figure 7 is a diagram that illustrates an example of an image wherein the image regions 
are pre-structured according to the type of image. 
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Figure 8 is a block diagram that illustrates another example of the detailed constitution of 
the image memory part (1 1) illustrated in Figure 1. 

Figure 9 is a block diagram that illustrates still another example of the detailed 
constitution of the image memory part (1 1) illustrated in Figure 1 . 

Figure 10 is a diagram that illustrates the processing flow according to a conventional 
image compression coding system. 



Explanation of symbols 

21 DCT transform part 

22 Letter block identification part 

23 Quantization table switching part 

24 Normalized quantization part 

25 Huffman coding part 

26 Region identification bit coding part 

27 Image memory 
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